40
Algorithms for Binary Neural Networks
Feature Maps
Reconstructed
Filters
Feature Maps
Center Loss
Unbinarized
Filters
Average
Feature Maps
MCConv
Binarized
Filters
Binarize
Filter Loss
Loss Function
Softmax Loss
M-Filters
Modulation Module
(MCconv layer)
Loss layer
forward propagation
backward propagation
MCconv
MCconv
Feature
Maps
Average
Feature
Maps
Output
Fully Connected
Layers
Convolution
Layers
Input
Cat
FIGURE 3.1
The overall frameworks of Modulated Convolutional Networks (MCNs).
3.4
MCN: Modulated Convolutional Network
In [199], XNOR-Network is presented where both the weights and inputs attached to the
convolution are approximated with binary values, which allow an efficient implementation
of convolutional operations, i.e., particularly by reconstructing unbinarized filters with a
single scaling factor and a binary filter. It has been theoretically and quantitatively demon-
strated that simplifying the convolution procedure via binarized filters and approximating
the original unbinarized filters is a very promising solution for CNNs compression.
However, the performance of binarized models generally drops significantly compared
with using the original filters. It is mainly due to the following reasons: 1) The binarization
of CNNs could be solved based on discrete optimization, which has long been neglected in
previous works. 2) Existing methods do not consider quantization loss, filter loss, and intr-
aclass compactness in the same backpropagation pipeline. 3) Rather than a single binarized
filter, a set of binarized filters can better approximate the full-precision convolution.
As a promising solution, Modulated Convolutional Network (MCN) [236] is proposed as
a novel binarization architecture to tackle these challenges toward highly accurate yet robust
compression of CNNs. Unlike existing work that uses a single scaling factor in compression
[199, 159], we introduce modulation filters (M-Filters) into CNNs to better approximate
convolutional filters. The proposed M-Filters can help the network fuse the feature in a
unified framework, significantly improving the network performance. To this end, a simple
and specific modulation process is designed that is replicable at each layer and can be easily
implemented. A complex modulation is also bounded as in [283]. In addition, we further
consider the intraclass compactness in the loss function and obtain modulated convolutional
networks (MCNs) 1. Figure 3.1 shows the architecture of MCN. MCNs are designed based
on binarized convolutional and modulation filters (M-Filters). M-Filters are mainly de-
signed to approximate unbinarized convolutional filters in the end-to-end framework. Since
an M-Filter (matrix) can be shared at each layer, the model size of MCNs is marginally in-
1The work has been commercialized.